Automatic Classification of Structured Product Labels for Pregnancy Risk Drug Categories, a Machine Learning Approach

نویسندگان

  • Laritza Rodriguez
  • Dina Demner-Fushman
چکیده

With regular expressions and manual review, 18,342 FDA-approved drug product labels were processed to determine if the five standard pregnancy drug risk categories were mentioned in the label. After excluding 81 drugs with multiple-risk categories, 83% of the labels had a risk category within the text and 17% labels did not. We trained a Sequential Minimal Optimization algorithm on the labels containing pregnancy risk information segmented into standard document sections. For the evaluation of the classifier on the testing set, we used the Micromedex drug risk categories. The precautions section had the best performance for assigning drug risk categories, achieving Accuracy 0.79, Precision 0.66, Recall 0.64 and F1 measure 0.65. Missing pregnancy risk categories could be suggested using machine learning algorithms trained on the existing publicly available pregnancy risk information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-label Classification of Product Reviews Using Structured Svm

Most of the text classification problems are associated with multiple class labels and hence automatic text classification is one of the most challenging and prominent research area. Text classification is the problem of categorizing text documents into different classes. In the multi-label classification scenario, each document is associated may have more than one label. The real challenge in ...

متن کامل

Text Mining and Classification of Product Reviews Using Structured Support Vector Machine

Text mining and Text classification are the two prominent and challenging tasks in the field of Machine learning. Text mining refers to the process of deriving high quality and relevant information from text, while Text classification deals with the categorization of text documents into different classes. The real challenge in these areas is to address the problems like handling large text corp...

متن کامل

Title: A Supervised Machine Learning Framework for the Extraction of Drug-Drug Interactions from Structured Product Labels Authors and affiliations:

Background: Information about drug-drug interactions (DDIs) is found in the medical literature and in drug package inserts published on DailyMed in addition to commercial drug databases. Objectives: To develop a machine learning framework for the extraction of DDIs from structured product labels (SPLs). Methods: We develop a supervised machine learning framework (support vector machine classifi...

متن کامل

Automatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique

The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...

متن کامل

Preventing adverse drug events by extracting information from drug fact sheets

Background: The increasing volume and growing complexity of drugs lead to an increased risk of prescription errors and adverse events. A correct drug choice must be modulated to acknowledge both patients’ status and drug-specific information. This information is reported in free-text on drug fact sheets. It is often overwhelming and difficult to access. There is thus a rising need for generatin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • AMIA ... Annual Symposium proceedings. AMIA Symposium

دوره 2015  شماره 

صفحات  -

تاریخ انتشار 2015